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TITLE OF THE INVENTION 

.MURINE AND HUMAN CERBERUS-LIKE PROTEINS AND COMPOSITIONS COMPRISING THEM 

5 The present invention relates to novel members of the cerberus protein family. 

DNA encoding them, and processes for obtaining them. These proteins may be used to 
induce expression of factors in and/or differentiation of tissue and organs, and 
particularly, inducing formation, growth, differentiation, proliferation and/or maintenance 
of neural, endoderm and cardiac tissue. Thus, these proteins may be useful in the 

10 treatment of wounds, tumors, and in the enhancement and/or inhibition of cellular 
formation, growth, differentiation, proliferation and/or maintenance of other tissue and 
organs, for example, epidermal, pancreatic, liver, spleen, lung, kidney, brain and/or other 
tissue. These proteins may also be used for augmenting the activity of other tissue 
regenerating and differentiation factors, such as the BMPs. The protein has been named 

1 5 mammalian cerberus-like by the inventors. 

BACKGROUND OF THE INVENTION 

The search for the molecule or molecules responsible for the formation, 
proliferation, differentiation and maintenance of tissue and organs, such as neurons and 
20 related neuronal cells and tissues, has been extensive as there is a tremendous need for 
factors useful for treating conditions involving degradation or damage to these tissues. 
A Xenopus protein previously identified in embryos, cerberus, appears to be involved in 
induction of the head. Bouwmeesier et al.. Nature , 382:595-601 (1996). 

25 SUMMARY OF THE INVENTION 

The inventors herein have discovered novel mammalian members of the cerberus 
family of proteins and have surprisingly discovered that members of the cerberus protein 
family are able to induce, enhance and/or inhibit the formation, growth, proliferation, 
differentiation, maintenance of neurons and/or related neural cells and tissues such as 
30 Schwann cells, glial cells and astrocytes. Accordingly, the present invention provides 
methods for inducing formation of neurons and/or related neural cells and tissues such 
as Schwann cells, glial cells and asu-ocytes, comprising administering to progenitor cells 
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5 a composition comprismg at least one protein which is a member of the cerberus protein 
family. 

The present invention relates to a family of proteins designated as cerberus, which 
• appears to be a pioneer protein, with a 9 cysteine residue pattern, which is present in the 
embryo. In Xenopus, cerberus mRNA is expressed at low levels in the unfertilized egg. 
10 and zygotic transcripts start accumulating at early gastrula. Expression continues during 
gastrulation and early neurulation. rapidly declining during neurulation. Imponantly. 
cerberus expression starts about one hour after that of chordin, suggesting that cerberus 
could act downstream of the chordin signal. The cerberus domain of the organizer 
includes the leading edge of the most anterior organizer cells and extends into the lateral 
15 mesoderm. The leading edge gives rise to liver, pancreas and foregut in its midline, and 
the more lateral region gives rise to heart mesoderm at later stages of development. 

In preferred embodiments, the composition may comprise a protein having the 
amino acid sequence of SEQUENCE ID N0:2 beginning at amino acid 1. 18to 24,41. 
85 to 91 or 162, and ending at amino acid 241 or 272; or SEQUENCE ID NO:8 
20 beginning at amino acid 1. 18 to 25, 41, 85 to 91 or 162, and ending at amino acid 241 
or 267. In one embodiment, the method comprises administering the composition to a 
patient in vivo. Alternatively, the method may comprise administering the composition 
to cells in vitro and recovering neurons and/or related neural cells and tissues such as 
Schwann cells, glial cells and astrocytes, which may subsequently be administered to a 
25 patient. The composition may further comprise a suitable cairier for administration. 

The present invention also provides novel DNA sequences encoding novel 
members of the cerberus protein family. In particular embodiments, the presem 
invention provides novel DNA sequences encoding mammalian cerberus proteins such 
as murine and human cerberus. The nucleotide sequences, and the corresponding amino 
30 acid sequences encoded by these DNA sequences, are provided in the Sequence Listings. 
In particular, the present invention comprises isolated DNA sequence encoding a 
mammalian cerberus protein comprising a DNA sequence selected from the group 
consisting of: nucleotides beginning at #58, 109, 178. 313, 316, 319, 322, 325, 328 or 
541 and ending at #780 or 873 of SEQ ID NO: 1; nucleotides beginning at # l', 52. 55 
35 58, 61, 64, 67, 70, 73, 121, 256, 259, 262, 265, 268, 271 or 484 and endmg at #723 or 
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5 801 of SEQ ID N0:7; or nucleotides encoding amino acids beginning at #1, 18, 41, 85 
to 91 or 162 and ending at #241 or 272 of SEQ ED NO: 2; or amino acids beginning at 
# 1, 18to 25,41, 85 to91 or 162 and ending at #241 or 267 of SEQ ID NO: 8, as well as 
' fragments and variants of the above sequences which are readily obtainable from the 
above and which maintain cerberus activity. The present invention funher comprises 

10 sequences which hybridize to these sequences under stringent hybridization conditions 
and encode a protein which exhibits cerberus activity. 

It is expected that mammalian cerberus protein, as expressed by 
mammalian cells such as CHO cells, exists as a heterogeneous population of active 
species of cerberus protein with varying N-termini. Based in part upon the Von Heginje 

15 signal peptide prediction algorithm, the first 17 to 24 amino acids appear to be involved 
in signaling for the secretion of the mature peptide. It is expected that active species may 
optionally include the signal peptide and will include amino acid sequences beginning 
with amino acid residues #1,18,19, 20, 2 1 , 22, 23, 24 or 25 of SEQ ID N0:2 or SEQ ID 
NO:8. Thus, it is expected that DNA sequences encoding active mammalian cerberus 

20 proteins include those comprising nucleotides #109, 1 12, 1 15, 1 18, 121, 124, 127 or 130 
to #780 or 873 of SEQ ID NO: 1; or comprising nucleotides #1, 52, 55, 58, 61, 64, 67, 
70 or 73 to #723 or 801 of SEQ ID N0:7. Accordingly, active species of cerberus-like 
protein are expected to include those comprising amino acids #1, 18, 19, 20, 21, 22, 23, 
24or25 to #241 or 272 of SEQ ID N0:2; or amino acids # 1 , 18, 19, 20,21,22,23,24 

25 or 25 to #241 or 267 of SEQ ID NO:8. 

As described further herein, it is further expected that cerberus and cerberus-like 
proteins may be proteolyticaliy processed by cells to form further active species. For 
example, putative proteolytic processing sites for cleavage, which are typically 
characterized by the formula R-X-K/R-R, are found at amino acids 37 to 40 and 82 to 85 

30 of SEQ ID N0:2 or SEQ ID N0:8. Thus, it is expected that DNA sequences encoding 
active manunalian cerberus proteins include those comprising nucleotides #178 or 313 
to #780 or 873 of SEQ ID N0:1; and # 121 or 256 to #723 or 801 of SEQ ID NO:7. 
Accordingly, further active species of cerberus4ike protein are expected to include those 
comprising amino acids beginning at #41 or 86 and ending at #241 or 272 of SEQ ED 

3 
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5 NO: 2; or comprising amino acids beginning at #41 or 86 and ending at #241 or 267 of 
SEQ ID NO:8. 

The above sequences of SEQ ID NO: 1 and 2 are used to isolate and sequence the 
human Cerbems-likt DNA and amino acid sequences. 

hi yet another embodiment, the present invention comprises a method of altering 
10 the regulation of genes in a patiem in need of same comprising administering to said 
patient an effective amount of the above compositions. The alteration of regulation of 
neuronal genes may be accomplished by stimulating or inhibiting binding by cerberus 
proteins of receptor proteins, including bone morphogenetic proteins [BMPs]. Thus, the 
mammalian cerberus and c.rW//*. protein family may be capable of inducing 
15 formation of neural, bone, cartilage and other Ussue. The mammalian cerberus and 
cerberus-like protein family may also be capable of inhibiting, augmenting or otherwise 
affectmg the acUvity of its receptor molecules to which it binds, including molecules of 
the BMP family. 

In other embodiments, the present invention comprises vectors comprising the 
20 above DNA molecules in operative association with an expression control sequence 
therefor, as well as host cells transformed with these vectors. In yet other embodiments 
the presem mvention comprises methods for producing purified mammalian cerberus 
protems, novel mammalian cerberus proteins, and compositions containing the 
mammalian cerberus proteins. These methods may comprise the steps of: culturing a 

25 hostcelltransforxnedwithaDNAsequenceencodingamammaliancer^protemsuch 
as descnbed above; and recovering and purifying said mammalian cerben. protein from 
the culture medium. The presem invention funher comprises the purified mammalian 
cerberus polypeptide produced by the above methods, as well as purified mammalian 
cerberus polypeptides comprising an amino acid sequence encoded by the above DNA 

30 sequences. The proteins of the present invention may comprise the amino acid sequence 
begmmng at amino acid #1,18, 19, 20, 21. 22, 23. 24, 25, 86, 87, 88, 89. 90. 91 or 162 
andendingat#241or272ofSEQIDNO:2; #1,18.19,20.21.22.23.24.25.86 87 
88. 89. 90. 91 or 162 and ending at #241 or 267 of SEQ ID NO:8: or a mammalian 
cerberus protein having a molecular weight of about 20-30 kd. said protein comprising 
an ammo acid sequence highly homologous to the amino acid sequences of SEQ ID NO-2 
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5 or SEQ ID N0:8, and having the ability to regulate the transcription of one or more 
genes. One species of active cerberus-like protein is a mature peptide contemplated to 
comprise the amino acid sequence from amino acids 18 to 272 of SEQ ID N0:2; or 
amino acid 18 lo 267 of SEQ ID NO:8, each expected to have a molecular weight of 
about 28.6 kD. Another species of active cerberus-like protein is a cleaved peptide 

10 contemplated to comprise the amino acid sequence from amino acids 86 to 9 1 and ending 
at amino acid 272 of SEQ ID N0:2, particularly from 18 or 90 to 272 of SEQ ID NO:2, 
or from amino acids 86 to 91 ending at amino acid 267 of SEQ ID NO:8, particularly 
from 18 or 90 to 267 of SEQ ID NO:8. The manire polypeptide for each of these proteins 
is expected to have a molecular weight of about 20.7 kD. 

15 Description of Sequences 

SEQ ID NO: 1 nucleotide sequence of mammalian cerberus DNA, panicularly murine 
cerberus DNA. 

SEQ ID NO: 2 amino acid sequence of the mammalian cerberus protein encoded by 
SEQ ID NO: 1. 

20 SEQ ID NO: 3 to 5 are consensus nucleotide sequences of probes to the cerberus and 
cerberus-like pTOieins: 

SEQ ID NO: 6 is the genomic DNA sequence encoding human Cerberus-like protein. 
The symbol "N" indicates that the nucleotide residue may be any of A, C, T or G. 
SEQ ID NO: 7 is the cDNA sequence encoding human Cerberus-like protein. 
25 SEQ ID NO: 8 is the amino acids sequence of human Cerberus-like protein encoded by 
SEQ ID NO: 7. 

Description of ATCC Deposits 

An £. coli DH5a strain transformed with pGIMCerb, which comprises the 
mammalian cerberus coding sequence described in Sequence ID NO: 1 has been 
30 deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, 
MD 20852, and has been accorded the ATCC accession number 98347. 
Detailed Description of the Invention 

As used herein, the terms ''cerberus" or "cerberus-like'' are both used to signify 
the protein family which comprises the cerberus and cerberus-like proteins, ''cerberus 
35 or cerberus-like protein" refers to mammalian cerberus and cerberus-like proteins, such 



BNSCXXilD' -cWO 9901S53A1 1 > 



wo 99/01553 



PCTAJS98/1I462 



5 as the murine or human cerberus proteins, and other proteins which share sequence 
homology to the highly conserved cysteine pattern of the C-terminal ponion of the 
mammalian cerberus proteins. One specific member of the cerberus protein family is the 
murine cerberus-like proKm, having the amino acid sequence specified in SEQUENCE 
ID N0:2. as well as homologues of this protein found in other species; and other proteins 
10 which are closely related structurally and/or functionally to murine cerberus. It is also 
expected that cerberus related proteins also exist in other species, including family 
members in Xenopus, and Drosophila. C. elegms, zebrafish, as well as in all mammals, 
for example, rats, mice and humans. "Cerberus or cerberus-like proteins" also includes 
variants of the cerberus proteins, such as allelic variants or variants induced by 
15 mutagenesis or deletions, and fragments of cerberus proteins which variants and 
fragments retain cerberus activity. ^Cerberus and Cerberus-like" are also used to signify 
the family of proteins sharing structural and/or functional similarity, including those 
proteins which are described further herein. 

As used herein, the term "cerberus or cerberus-like activity" refers to one or more 
20 of the activities which are exhibited by the mammalian cerberus-like proteins of the 
present invention. In panicular. "cerberus or cerberus-like activity" includes the ability 
to induce, enhance and/or inhibit the formation, growth, proliferation, differentiation, 
maintenance of neurons and/or related neural cells and tissues such as brain cells, 
Schwann cells, glial cells and astrocytes. ^Cerberus or cerberus-like" activity also 
25 includes the ability to induce molecular markers of neuroendocrine or ectodemi tissue, 
such as OTX2, N-CAM. MASH, chromagranin, and AP2, as well as the ability to induce 
the formation of neurons and/or related neural cells and tissues such as brain cells, 
Schwann cells, glial cells and asu-ocytes. "Cerberus or cerberus-like activity" may also 
include the ability to regulate the interaction of ligands and their protein receptors. For 
30 example, "cerberus or cerberus-like activity" may include the ability to bind to one or 
more members of the bone morphogenetic protein [BMP] and/or wnt protein families, 
and thereby inhibit, augment or otherwise affect the activity of such molecules. 
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5 "Cerberus or cerberus-like activity" may further include the ability to regulate the 
formation, differentiation, proliferation and/or maintenance of other cells and/or tissue, 
for example connective tissue, organs and wound healing. In particular, "cerberus or 
■ cerberus-like activity" may include the ability to enhance and/or inhibit the formation, 
growth, proliferation, differentiation and/or maintenance of cardiac, spleen, liver, 

10 pancreas, stomach, kidney, lung and brain cells and tissue, as well as osteoblasts and 
bone, chondrocytes and cartilage, tendon, epidermis and muscle. ^'Cerberus and 
cerberus-like activity" also includes the activities of cerberus and cerberus-like protein 
in the assays described in the examples and specification herein. 

Cerberus and cerberus-like cDNA should be useful as a diagnostic tool (such as 

15 through use of antibodies in assays for proteins in cell lines or use of oligonucleotides as 
primers in a PGR test to amplify those with sequence similarities to the oligonucleotide 
primer, and to determine how much cerberus is present). Cerberus might act upon its 
target cells via its 9wn receptor. Cerberus, therefore, may be useful for the isolation of 
that receptor. In addition, cerberus or its receptor should prove useful as a diagnostic 

20 probe for certain tumor types. Thus, cerberus, its receptor, or antibodies to either may 
be potent agonists or antagonists which may be clinically useful. In addition, complexes 
of cerberus and its receptor, cerberus and antibodies to it, or cerberus receptor and 
antibodies to it, may each be useful in a number of in vitro j ex vivo or clinical uses. 

The present invention also includes protein variants and functional fragments of 

25 the amino acid sequence of the manunalian cerberus protein shown in SEQ ID NO: 2 
which retain cerberus activity. The present invention also includes antibodies to a 
purified mammalian cerberus protein such as the above. The compositions of the present 
invention comprise a therapeutic amount of at least one of the above manunalian 
cerberus proteins. It is expected that such protein variants and functional fragments of 

30 cerberus or cerberus-like proteins will include amino acid sequences which share 
significant homology with the amino acid sequence of SEQ ID NO: 2, most preferably 
at least 80% or 90% amino acid identity. The variants and functional fragments which 
retain cerberus-like activity are expected to include those which retain the cysteine 
pattern found in the SEQ ID NO: 2. For example, a truncated polypeptide comprising 

35 amino acids #162 to #241 of SEQ ID NO: 2; or from #162 to #241 of SEQ ID N0:8 will 
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5 each retain the full 9 cysteine pattern found in the carboxy terminal portion of the 
Cerberus and cerberus-like sequences of SEQ ID NO: 2 and SEQ ID N0:8, respectively. 

In yet another embodiment, the present invention comprises a method of altering 
the regulation of genes in a patient in need of same comprising administering to said 
patient an effective amount of the above compositions. For example, the alteration of 
10 regulation of neuronal genes may be accomphshed by stimulating or inhibiting binding 
of receptor proteins, for example, binding between tiie mammalian cerberus protein and 
as receptor protein, such as a wm protein, or a BMP protein. Thus, cerberus proteins 
may be capable of regulating tiie binding interaction of ligands to their receptor proteins, 
as well as the interaction of transcriptional factors on cells. 
15 The present invention also encompasses hybrid or fusion vectors comprising the 

codmg DNA sequences of the present invention and other cerberus encoding sequences 
hnked to a tissue specific or inducible regulatory sequence, such as a promoter or 
operator. In a preferred embodiment of the invention, the coding sequence for 
mammalian cerberus-like protein is operably linked to one or more promoters, enhance,. 
20 and/or other regulatory elements from genes which are selectively expressed in neurons 
and/or related neural cells and tissues such as Schwann cells, glial cells and astrocytes 
For example, ti.e promoter of the GFAP gene, which is known to be expressed in 
astrocytes and neuronal cells; and the promoter of the 07X2 gene, which is known to be 
expressed .n the anterior brain, are suitable for Uie tissue specific production of cerberus 
25 Additionally, the DNA sequence encoding mammalian cerberus may be operatively 
Imked to one or more regulatory sequences from GFAP or 07X2 proteins, as well as 
otiier proteins which are selectively produced in neurons and/or related neural cells and 
tissues such as Schwann cells, glial cells and astrocytes. 

In other preferred embodiments of the invention, the coding sequence for 
mammalian cerberus-li,e protein is operably linked to the promoter isolated from other 

genes, organs or cells ofinterest. Vectors using other tissue-selective regulatoiy elements 
and inducible regulatory elements may also be useful for the selective or inducible 
expression of the mammalian cerberus-like proteins of the present invention 

Another aspect of the invention provides pharmaceutical compositions containing 
35 a therapeutically effective amount of mammalian cerberus-like protein m a 
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5 pharmaceuiically acceptable vehicle or carrier. These compositions of the invention may 
be used in the formation of neurons and/or related neural cells and tissues such as 
Schwann cells, glial cells and astrocytes, as well as liver, pancreas, lung, heart, kidney, 
' spleen, stomach, cardiac tissue and cells, as well as connective tissue and cells, including 
osteocytes, chondrocytes, myocytes, tendon cells, epidermal cells, and adipocytes. These 

10 compositions may further be utilized in order to enhance and/or inhibit the formation, 
growth, proliferation, differentiation and/or maintenance of bone, osteoblasts, cartilage, 
chondrocytes, beta cells and other cell types typically found in the islets of Langerhans 
or other pancreatic cells, as well as other organ tissues such as epidermis, spleen, brain, 
lung and kidney tissue. The compositions comprising mammalian cerberus-like protein 

15 may be used to treat precursor or stem cells, such as endoderm cells, which are able to 
differentiate into cells which comprise differentiated tissue or organs, such as cardiac and 
neural cells, in order to enhance the formation, differentiation, proliferation and/or 
maintenance of such cells, tissue or organs. Methods for forming and maintaining 
pancreatic cells are described, for example, in WO93/00441, the disclosure of which is 

20 hereby incorporated herein by reference. In addition, the compositions may be used to 
regulate embryonic development, for instance, by affecting the development of 
embryonic cells and tissue into the endodermal phenotype. 

The compositions of the invention may comprise, in addition to a mammalian 
cerberus-like protein, other therapeutically useful agents including growth factors such 

25 as epidermal growth factor (EGF), fibroblast growth factor (FGF), transforming growth 
factor (TGF-a and TGF-P), Writs, hedgehogs, including sonic, Indian and desert 
hedgehogs, activins, inhibins, bone morphogenetic proteins (BMP), and insulin-like 
growth factor (IGF). The compositions may also include an appropriate matrix, for 
instance, for supporting the composition and providing a surface for ingrowth of neurons 

30 and/or related neural cells and tissues such as Schwann cells, glial cells and astrocytes, 
or for other tissue or cell growth. The maurix may provide slow release of the 
mammalian cerberus-like protein and/or the appropriate environment for presentation 
thereof. 

The mammalian cerberus-like protein containing compositions may be employed 
35 in methods for treating a number of tissue defects, and healing and maintenance of 
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5 various types of tissues and wounds. The tissues and wounds which may be treated 
include repair or induction of neurons and/or related neural cells and tissues such as 
Schwann ceUs. glial cells and astrocytes. It also includes treatment of cardiac, liver, 
■ pancreas, spleen, lung, kidney, brain and stomach tissue and may also include canilage. 
epidermis, muscle, including cardiac muscle, other connective tissue, such as bone,' 
10 tendon and ligament and other tissues and wounds. These methods, according to the 
invenuon. entail administering to a patient needing such tissue formation, wound healing 
or tissue repair, an effective amount of mammalian cerberus protein. The mammalian 
cerberus-like containing compositions may also be used to treat or prevent degenerative 
nerve conditions such as Parkinson's Disease, Alzheimer's Disease and Lou Gehrig's 
15 Disease, as well as other degenerative nerve diseases, and oUier conditions involving 
defects of neural tissue. The compositions may also be useful to treat other conditions 
such as osteoporosis, rheumatoid arthritis, osteoarthritis, and other abnonnalities of 
connective tissue, or of other organs or tissues, such as muscle, pancreas, liver, spleen 
lung, cardiac, brain, and kidney tissue, and other tissues and organs. These methods may 
20 also entail the administration of a protein of the invention in conjunction with 
administration of at least one other protein, for example growth factors including EGF 
FGF, TGF-a. TGF-p. BMP, Wnt. hedgehogs, including sonic. Indian and desert 
hedgehogs, activin. inhibin and IGF. In a particular embodiment of the present invention 
the mammalian cerberus-like gene or protein may be used to augment the activities of 
25 BMPs or other members of the TGF-P superfamily. 

Still a further aspect of the invention are DNA sequences coding for expression 
of mammalian cerberus-Uke protein. Such sequences include the sequence of nucleotides 
a 5' to 3- direction illustrated in SEQ ID NO:I or SEQ ID NO:7. DNA sequences 
which, but for the degeneracy of the genetic code, are identical to the DNA sequence 
30 SEQ ID NO: 1 or SEQ ID N0:7, and encode the protein of SEQ ID N0:2 or SEQ ID 
N0:8. Further included in the present invention are DNA sequences which hybridize 
under stnngent conditions with the DNA sequence of SEQ ID NO: I or SEQ ID NO T and 
encode a protein having cerberus-like activity. Preferred DNA sequences include those 
which hybndize under stringent conditions [see. T. Maniatis et al. Molecular Clnninp (A 
35 i^teoDLMMual), Cold Spring Harbor U^^^^^^ 
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5 generally preferred that such DN A sequences encode a polypeptide which is at least about 
80% homologous, and more preferably at least about 90% homologous, to the mature 
mammalian cerbems-like amino acid sequence shown in SEQ ID NO:2 or SEQ ID NO:8. 
■ Further, allelic or other variations of the sequences of SEQ ID NO: 1 or SEQ ID N0:2, 
whether such nucleotide changes result in changes in the peptide sequence or not, but 

10 where the peptide sequence still has cerberus-like activity, are also included in the present 
invention. The present invention also includes functional fragments of the DNA 
sequence of manmialian cerberus-like proteins shown in SEQ ID NO: 1 or SEQ ID NO:7 
which encode a polypeptide which retains the activity of cerberus-like protein. The 
determination whether a particular variant or fragment of the mammalian cerberus-like 

15 protein of the present invention, such as those shown in SEQ ID NO:2 or SEQ ID NO:8, 
maintain cerberus-like activity, is routinely performed using the assays described in the 
examples and specification herein. 

The DNA sequences of the present invention are useful, for example, as probes 
for the detection of nxElNA encoding other cerberus-like proteins in a given cell 

20 population. The DNA sequences may also be useful for preparing vectors for gene 
therapy applications as described below. 

A further aspect of the invention includes vectors comprising a DNA sequence 
as described above in operative association with an expression control sequence therefor. 
These vectors may be employed in a novel process for producing a recombinant 

25 mammalian cerberus-like protein of the invention in which a cell line transformed with 
a DNA sequence encoding mammalian cerberus-like protein in operative association with 
an expression control sequence therefor, is cultured in a suitable culture medium and 
mammalian cerberus-like protein is recovered and purified therefrom. This process may 
employ a number of known cells both prokaryotic and eukaryotic as host cells for 

30 expression of the polypeptide. The vectors may also be used in gene therapy applications. 
In such use, the vectors may be transfected into the cells of a patient ex vivo, and the cells 
may be reintroduced into a patient, or the vectors may be introduced into a patient in vivo 
through targeted transfection. Alternatively, homologous cerberus gene expression may 
be upregulated by known recombination techniques to insert high expression regulatory 

35 elements into the genome in proximity to the cerberus coding sequence disclosed herein. 
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5 In a preferred embodiment of the invention, vectors are prepared using one or 

more non-native regulatory elements, such as promoters and/or enhancers operatively 
associated with the codmg sequence for mammalian cer/,.r«..//;te. in order to achieve 
■ expression of mammalian cerberus-like in desired cell tissue and/or at a desired time in 
development. For example, a vector may be constructed using the promoter element 
10 from genes, which is known to be constitutively expressed in neuronal development. By 
operatively associating the promoter from suitable genes with the coding sequence for 
cerberus-like, and transforming suitable cells, such as neuronal stem cells, one can 
express mammalian cerberus-like in these cells, thus promoting the desired effects of 
formation, growth, proliferation, differentiation and/or maintenance of cells such as 
15 neurons and/or related neural cells and tissues such as Schwann cells, glial cells and 
astrocytes, either in in vitro culture or in vivo. 

Still a further aspect of the invention are mammalian cerberus-like proteins or 
polypeptides. Such polypeptides are characterized by having an amino acid sequence 
mcluding the sequence illustrated in SEQ ID NO:2 or SEQ ID NO:8, variants of the 
20 amino acid sequence of SEQ ID N0:2 or SEQ ID NO:8, including naturally occurring 
allelic variants, and other variants in which the protein retains cerberus-like activity, for 
example, the abihty to enhance and/or mhibit the formation, growth, proliferation 
differentiation and/or maintenance of neurons and/or related neural cells and tissues such 
as Schwann cells, glial cells and astrocytes, and may also affect the formation, growth 
25 prohferation, differentiation and/or maintenance of pancreas, liver, stomach, cardiac or 
other tissue such as bone, osteocytes. chondrocytes and/or cartilage tissue, or other organ 
tissue, such as spleen, lung, brain and kidney tissue, characteristic of cerberus-like 
protein. Preferred polypeptides include a polypeptide which is at least about 80% and 
more preferably at least about 90% homologous to the mature mammalian cerberus-like 
30 ammo acid sequence shown in SEQ ID N0:2 or SEQ ID N0:8. Further, allelic or other 
variations of the sequences of SEQ ID NO:2 or SEQ ID N0:8. whether such amino acid 
changes are induced by mutagenesis, chemical alteration, or by alteration of DNA 
sequence used to produce the polypeptide, where the peptide sequence still has cerberus- 
Uke activity, are also included in the present invenuon. The present invention also 
35 includes fragments of the amino acid sequence of mammalian cerberus-like shown in 
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5 SEQ ID N0:2 or SEQ ID N0:8 which retain the activity of cerberus-like protein. One 
skilled in the art can readily produce such variations and fragments of mammalian 
cerberuS'like protein using techniques known in the an, and can readily assay them for 
activity, as described herein. 

The purified proteins of the present inventions may be used to generate 

10 antibodies, either monoclonal or polyclonal, to mammalian cerberus-like proteins and/or 
other related proteins, using methods that are known in the art of antibody production. 
Thus, the present invention also includes antibodies to manmialian cerberus and/or other 
cerberus-like proteins. The antibodies may be useful for purification of mammalian 
cerberus-like proteins, or for inhibiting or preventing the effects of cerberus proteins 

15 either in vitro or in vivo. The manmialian cerberus-like proteins may be useful for 
inducing the growth and/or differentiation of embryonic cells and/or stem cells. Thus, 
the proteins or compositions of the present invention may also be useful for treating cell 
populations, such as embryonic cells or stem cell populations, to enhance, enrich or to 
inhibit the growth and/or differentiation of the cells. For example, the mammalian 

20 cerberus-like proteins may be useful for treating cell populations to enhance and/or 
inhibit the formation, differentiation, proliferation and/or maintenance of neurons and/or 
related neural cells and tissues such as Schwann cells, glial cells and astrocytes and/or 
other cells and tissue. The treated cell populations may be useful for, among other things, 
gene therapy applications, as described below. Thus» the proteins of the present invention 

25 may be useful in wound healing, tissue and organ repair and regeneration processes, as 
well as in differentiation of tissue, for example in embryonic development. In particular, 
it has been observed by the inventors that the manmialian cerberus-like protein may be 
useful for the induction, formation, growth, differentiation, proliferation and/or 
maintenance and repair of neurons and/or related neural cells and tissues such as 

30 Schwann cells, glial cells and asu-ocytes. The cerberus-like proteins are normally present 
as secreted proteins, and have been demonstrated to have effects on the growth and 
differentiation of neuronal and other neural cell and tissue types. Thus, these proteins, 
and compositions containing them, may be useful in the treatment of nerve and brain 
disorders, such as Parkinson's disease, Alzheimer's disease, and in the enhancement 

35 and/or inhibition of cellular formation, growth, differentiation, proliferation and/or 
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5 maintenance, for example formation of neurons and/or related neural cells and tissues 
such as Schwann cells, glial cells and astrocytes. 

The mammalian cerberus-like proteins provided herein include factors encoded 
• by the sequences similar to those of SEQ ID NO:l or SEQ ID NO:7, but into which 
modifications or deletions are naturally provided (e.g. allelic variations in the nucleotide 
10 sequence which may result in amino acid changes in the polypeptide) or deliberately 
engineered. For example, synthetic polypeptides may wholly or partially duplicate 
continuous sequences of the amino acid residues of SEQ ID N0:2 or SEQ ID N0:8. 
These sequences, by virtue of sharing primary, secondary, or tertiary structural and 
conformational characteristics with mammalian cerberus-like polypeptides of SEQ ID 
15 N0:2 or SEQ ID NO:8 may possess biological properties in common therewith. Thus, 
these modifications and deletions of the native maimnalian cerberus-like may be 
employed as biologically active substitutes for naturally-occuning mammalian cerberus- 
like polypeptides in therapeutic processes. It can be readily determined whether a given 
variant or fragmem of mammalian cerberus-like protein maintains the biological activity 
20 of Cerberus by subjecting both mammalian cerberus-like and tiie variant or fragment of 
mammalian cerberus-like to the assays described herein. 

Otiier specific mutations of the sequences of mammalian cerberus-like proteins 
described herein involve modifications of glycosylation sites. These modifications may 
involve O-linked or N-linked glycosylation sites. For instance, the absence of 
25 glycosylation or only partial glycosylation results from ammo acid substitution or deletion 
at asparagine-linked glycosylation recognition sites. The asparagine-linked glycosylation 
recognition sites comprise tripeptide sequences which are specifically recognized by 
appropriate cellular glycosylation enzymes. These tripeptide sequences are either 
asparagine-X-threonine or asparagine-X-serine, where X is usually any amino acid. A 
30 variety of amino acid substitutions or deletions at one or both of the first or third amino 
acid positions of a glycosylation recognition site (and/or amino acid deletion at the 
second position) results in non-glycosylation at the modified tripeptide sequence. Such 
variants of mammalian cerberus-like are within the present invention. Additionally, 
bacterial expression of mammalian cerberus-like proteins will result in production of a 
35 non-glycosylated protein, even if the glycosylation sites are left unmodified. Such 
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5 bacterially produced versions of mammalian cerberus-like are within the present 
invention. 

The present invention also encompasses the novel DNA sequences, free of 
' association with DNA sequences encoding other proteinaceous materials, and coding for 
expression of maimnalian cerberus-like proteins. These DNA sequences include those 

10 depicted in SEQ ID NO: 1 or SEQ ID N0:7 in a 5' to 3' direction and those sequences 
which hybridize thereto under stringent hybridization conditions [for example, 0. IX SSC, 
0.1% SDS at 65^*0; see, T. Maniatis et al, Molecular Cloning (A Laboratory Manual) , 
Cold Spring Harbor Laboratory (1982), pages 387 to 389] and encode a protein having 
cerberus-like activity. Stringent hybridization conditions also refer to initial low 

15 stringency hybridization conditions, followed by higher stringency wash conditions. 
These DNA sequences also include those which comprise variants and fragments of the 
DNA sequence of SEQ ID NO:l SEQ ID NO:7 which hybridize thereto under stringent 
hybridization conditions and encode a protein having cerberus-like activity. 

Similarly, DNA sequences which code for mammalian cerberus-like proteins 

20 coded for by the sequences of SEQ ID NO: 1 or SEQ ID NO:7, or mammalian cerberus- 
like proteins which comprise the amino acid sequence of SEQ ID NO:2 or SEQ ID NO:8, 
but which differ in codon sequence due to the degeneracies of the genetic code or allelic 
variations (naturally-occurring base changes in the species population which may or may 
not result in an amino acid change) also encode the novel factors described herein. 

25 Variations in the DNA sequences of SEQ ID NO: 1 or SEQ ID NO:7 which are caused 
by point mutations or by induced modifications (including insertion, deletion, and 
substitution) to enhance the activity, half-life or production of the polypeptides encoded 
are also encompassed in the invention. 

Another aspect of the present invention provides a novel method for producing 

30 manmialian cerberus-like proteins. The method of the present invention involves 
culturing a suitable cell line, which has been transformed with a DNA sequence encoding 
a manmialian cerberus-like protein of the invention, under the control of known 
regulatory sequences. The transformed host cells are cultured and the mammalian 
cerberus-like proteins recovered and purified from the culture medium. The purified 
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5 proteins are substantially free from other proteins wuh which they are co-produced as 
well as from other contaminants. 

Suitable cells or cell lines may be mammalian cells, such as Chinese hamster 
ovao^ cells (CHO). The selection of suitable mammalian host cells and methods for 
transformation, culture, amplification, screening, product production and purification are 
10 known in the art. See, e.g.. Gething and Sambrook, Nature. 293:620-625 (1981), or 
altemauvely. Kaufman et al, Mol. Cell. Biol , 5(7):1750-1759 (1985) or Howley et al, 
U.S. Patem 4,419,446. Another suitable mammalian cell line, which is described in the 
accompanying examples, is the monkey COS-1 cell hne. The mammalian cell CV-1 may 
also be suitable. 

15 Bacterial cells may also be suitable hosts. For example, the various strains of 

E. coli (e.g., HBIOl, MC1061) are well-known as host cells in the field of 
biotechnology. Various strains of B. subtilis. Pseudomonas. other bacilli and the like 
may also be employed in this method. For expression of the protein in bacterial cells. 
DNA encoding the propeptide of cerberus-like is generally not necessary. 
20 Many su-ains of yeast cells known to those skilled in the art may also be available 

as host cells for expression of the polypeptides of the present invention. Additionally 
where desired, insect cells may be utilized as host cells in the method of the present 
invention. See, e.g. Miller et al. Genetic Ennneerin., 8:277-298 (Plenum Press 1986) 
and references cited therein. 

25 Another aspect of the present invention provides vectors for use in the method of 

expression of these novel mammalian cerberus-Uke polypeptides. Preferably the vectors 
contain the full novel DNA sequences described above which encode the novel factors 
of the invention. Additionally, the vectors contain appropriate expression control 
sequences permitting expression of the cerberus-like protein sequences. Alternatively 
vectors incon^orating modified sequences as described above are also embodiments of 
the present invention. Additionally, the sequence of SEQ ID N0:1. SEQ ID NO-7 or 
other sequences encoding mammalian cerberus-like proteins could be manipulated to 
express a mature mammalian protein by deleting mammalian cerberus-like 

propeptide sequences and replacing them with sequences encoding the complete 
35 propeptides of other c.rW/.7:. proteins or other suitable propeptides. Thus the 
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5 present invention includes chimeric DNA molecules encoding a propepiide from a 
member of the cerberus-like family linked in correct reading frame to a DNA sequence 
encoding a mammalian cerberus-like polypeptide. 

The vectors may be employed in the method of u-ansforming ceU lines and contain 
selected regulatory sequences in operative association with the DNA coding sequences 

10 of the invention which are enable of directing the replication and expression thereof in 
selected host cells. Regulatory sequences for such vectors are known to those skilled in 
the art and may be selected depending upon the host cells. Such selection is routine and 
does not form part of the present invention. 

In order to produce rat, human or other mammalian cerberus-like proteins, the 

15 DNA encoding it is transferred into an appropriate expression vector and introduced into 
mammalian cells or other preferred eukaryotic or prokaryotic hosts by conventional 
genetic engineering techniques. The preferred expression system for biologically active 
recombinant mammalian cerberus-like is contemplated to be stably transformed 
mammalian cells. 

20 One skilled in the art can construct mammalian expression vectors by employing 

the sequence of SEQ ID N0:1, SEQ ID N0:7, or other DNA sequences encoding 
cerberus-like proteins or other modified sequences and known vectors, such as pCD 
[Okayama et al., Mol. Cell Biol. . 2:161-170 (1982)], pJL3, pJL4 [Gough et al., EMBO 
4:645-653 (1985)] and pMT2 CXM. 

25 The mammalian expression vector pMT2 CXM is a derivative of p9 1023(b) 

(Wong et al., Science 228:810-815, 1985) differing from the latter in that it contains the 
ampicillin resistance gene in place of the tetracycline resistance gene and further contains 
a Xhol site for insertion of cDNA clones. The functional elements of pMT2 CXM have 
been described (Kaufman, R.J., 1985, Proc. Natl. Acad. Sci. USA 82:689-693) and 

30 include the adenovirus VA genes, the SV40 origin of replication including the 72 bp 
enhancer, the adenovirus major late promoter including a 5' splice site and the majority 
of the adenovirus uipartite leader sequence present on adenovirus late niRNAs, a 3* splice 
acceptor site, a DHFR insert, the SV40 early polyadenylation site (S V40), and pBR322 
sequences needed for propagation in E. coli . 
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5 Plasmid pMT2 CXM is obtained by EcoRI digestion of pMT2-VWF, which has 

been deposited with the American Type Culture Collection (ATCC), Rockville, MD 
(USA) under accession number ATCC 67122. EcoRI digestion excises the cDNA insert 
■ present in pMT2-VWF, yielding pMT2 in linear fonn which can be ligated and used to 
transform E. coH HB 101 or DH-5 to ampicillin resistance. Plasmid pMT2 DNA can be 
10 prepared by conventional methods. pMT2 CXM is then constructed using loopout/in 
mutagenesis [Morinaga, et al.. Biotechnology 84: 636 (1984). This removes bases 1075 
to 1 145 relative to the Hind ffl site near the S V40 origin of replication and enhancer 
sequences of pMT2. In addition it inserts the following sequence: 
5* PO-CATGGGCAGCTCGAG-3' 
15 at nucleotide 1145. This sequence contains the recognition site for the restriction 
endonuclease Xho I. A derivative of pMT2CXM. termed pMT23, contains recognition 
sues for the restricuon endonucleases Psd, Eco RI. Sail and Xhol. Plasmid pMT2 CXM 
and pMT23 DNA may be prepared by conventional methods. 

PEMC2P1 derived from pMT21 may also be suitable in practice of the invention 
20 PMT21 isderivedfrompMT2whichisderivedfrompMT2-VWF. As described above 
EcoRI digestion excises the cDNA insert present in pMT-VWF, yielding pMT2 in linear 
form which can be ligated and used to transform E. CoH HB 101 or DH-5 to ampicillin 
resistance. Plasmid pMT2 DNA can be prepared by conventional methods. 

PMT21 is derived from pMT2 through the following two modifications. First 
25 76 bp of the 5' untranslated region of the DHFR cDNA including a stretch of 19 G 
residues from G/C tailing for cDNA cloning is deleted. In this process, a Xhol site is 
inserted to obtain the following sequence immediately upstream from DHFR- 5' - 

CTGCAGGCGAGCCTGMITCCTCGAGCCATCAIG-3' 
30 ^^^^ EcoRI Xhol 

Second, a unique Clal site is introduced by digestion with EcoRV and Xbal, treatment 
with Klenow fragment of DNA polymerase I. and ligation to a Clal linker 
(CATCGATG). This deletes a 250 bp segment from the adenovirus associated RNA 
(VAD region but does not interfere with VAI RNA gene expression or function pMT21 
35 IS digested with EcoRI and Xhol, and used to derive the vector pEMC2B 1 
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5 A portion of the EMCV leader is obtained from pMT2-ECATl [S.K. Jung, et al, 

J. Virol 63: 165 M660( 1989)1 by digestion with Eco RI and PstI, resulting in a 2752 bp 
fragment. This firagment is digested with TaqI yielding an Eco RI-TaqI fragment of 508 
bp which is purified by electrophoresis on low melting agarose gel. A 68 bp adapter and 
its complementary strand are synthesized with a 5' TaqI protruding end and a 3' Xhol 
10 protruding end which has the following sequence: 

5'CGAGGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTC 

GTTT 
TaqI 

15 

GAAAAACACGATTGC-3' 
Xhol 

This sequence matches the EMC vims leader sequence from nucleotide 763 to 827. It 
20 also changes the ATG at position 10 within the EMC virus leader to an ATT and is 
followed by a Xhol site. A three way ligation of the pMT21 Eco Rl-Xhol fragment, the 
EMC virus EcoRI-TaqI fragment, and the 68 bp 

oligonucleotide adapter Taql-Xhol adapter resulting in the vector pEMC2pi. 

This vector contains the S V40 origin of replication and enhancer, the adenovirus major 

25 late promoter, a cDNA copy of the majority of the adenovirus tripartite leader sequence, 
a small hybrid intervening sequence, an S V40 polyadenylation signal and the adenovirus 
VA I gene, DHFR and P-lactamase markers and an EMC sequence, in appropriate 
relationships to direct the high level expression of the desired cDNA in manunalian cells. 
The construction of vectors may involve modification of the manunalian cerberus 

30 DNA sequences. For instance, mammalian cerberus cDNA can be modified by removing 
the non-coding nucleotides on the 5* and 3* ends of the coding region. The deleted non- 
coding nucleotides may or may not be replaced by other sequences known to be 
beneficial for expression. These vectors are transformed into appropriate host cells for 
expression of mammalian cerberus proteins. Additionally, the sequence of SEQ ID NO: 1 

35 other sequences encoding manunalian cerberus proteins can be manipulated to express 
a mature mammalian cerberus protein by deleting mammalian cerberus encoding 
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5 propeptide sequences and replacing them with sequences encoding the complete 
propeptides of other proteins. 

One skilled in the an can manipulate the sequences of SEQ ID NO: 1 or SEQ ID 
"N0:7 by eliminating or replacing the mammalian regulatory sequences flanking the 
coding sequence with bacterial sequences'to create bacterial vectors for intracellular or 
10 extracellular expression by bacterial cells. For example, the coding sequences could be 
further manipulated (e.g. ligated to other known linkers or modified by deleting non- 
coding sequences therefrom or altering nucleotides therein by other known techniques). 
The modified mammalian cerberus coding sequence could then be inserted into a known 
bacterial vector using procedures such as described in T. Taniguchi ei al., Proc. Natl 
15 Acad^SciJJSA. 77:5230-5233 (1980). This exemplary bacterial vector could then be 
transformed into bacterial host cells and a protein expressed thereby. For a strategy for 
producing extracellular expression of mammalian cerberus proteins in bacterial cells, see, 
e.g. European patent application EPA 177,343. 

Similar manipulations can be perfoimed for the construction of an insect vector 
20 [See, e.g. procedures described in published European patent application 155,476] for 
expression in insect cells. A yeast vector could also be constructed employing yeast 
regulatory sequences for intracellular or extracellular expression of the factors of the 
present invention by yeast cells. (See. e.g.. procedures described in published PCT 
application WO86/00639 and European patent application EPA 123,289]. 
25 A method for producing high levels of a mammalian cerberus protein of the 

invention in mammalian cells may involve the construction of cells containing multiple 
copies of the heterologous mammalian cerberus gene. The heterologous gene is linked 
to an amplifiable marker, e.g. the dihydrofolate reductase (DHFR) gene for which cells 
containing increased gene copies can be selected for propagation in increasing 
30 concentrations of methotrexate (MTX) according to the procedures of Kaufman and 
Sharp, J. Mol. Biol. . 159:601-629(1982). This approach can be employed with a number 
of different cell types. 

For example, a plasmid containmg a DNA sequence for a mammalian cerberus 
protein of the invention in operative association with other plasmid sequences enabling 
35 expression thereof and the DHFR expression plasmid pAdA26SV(A)3 [Kaufman and 
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5 Sharp, Mol. Cell Biol. , 2: 1 304 ( 1 982)] can be co-introduced into DHFR-deficient CHO 
cells, DUKX-Bn, by various methods including calcium phosphate coprecipiiation and 
transfection, electroporation or protoplast fusion. DHFR expressing transformants are 
' selected for growth in alpha media with dialyzed fetal calf serum, and subsequently 
selected for amplification by growth in increasing concentrations of MTX (e.g. sequential 

10 steps in 0.02, 0.2, 1,0 and 5uM MTX) as described in Kaufman et al., Mol Cell Biol. , 
5:1750 (1983). Transformants are cloned, and biologically active mammalian cerberus 
expression is monitored by assay in one of the assays described in the examples and 
specification. Mammalian cerberus protein expression should increase with increasing 
levels of MTX resistance. Mammalian cerberus polypeptides are characterized using 

15 standard techniques known in the art such as pulse labeling with [35S] methionine or 
cysteine and polyacrylamide gel electrophoresis. Similar procedures can be followed to 
produce other related cerberus proteins. 

A mammalian cerberus protein of the present invention, which demonstrates 
cerberus activity, has application in the induction, formation, growth, differentiation, 

20 proliferation and/or maintenance and healing of cells and tissues such as neurons and/or 
related neural cells and tissues such as Schwann cells, glial cells and asu-ocyies, and other 
tissues, in humans and other animals. Such a preparation employing mammalian 
cerberus protein may have prophylactic use in treatment of Parkinson's disease, 
Alzheimer's disease, as well as preventing neural tumors, and other neural tissue 

25 disorders. De novo formation of neurons and/or related neural cells and tissues such as 
Schwann cells, glial cells and astrocytes, and other cells of neural phenotype, induced by 
a cerberus protein contributes to the repair of congenital, trauma induced, or oncologic 
tissue defects or conditions. Mammalian cerberus protein may also be used in the 
treatment of neural disease, and in other tissue and organ repair processes. Such agents 

30 may provide an environment to attract suitable stem cells, stimulate growth and 
proliferation of neuron-forming cells or induce differentiation of progenitors of 
neuron-forming cells, and may also suppon the regeneration of other tissues and organs. 
Mammalian cerberus polypeptides of the invention may also be useful in the treatment 
of organ disorders. 
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5 The proteins of the invention may also be used in wound healing and in related 

tissue repair. The types of wounds include, but are not limited to bums, incisions and 
ulcers. (See, e.g. PCT Publication WO84/01 106 for discussion of wound healing and 
related tissue repair). It is further contemplated that proteins of the invention may 
increase neuronal, astrocytic and/or glial cell survival and therefore be useful in 

10 transplantation and u^eatment of conditions exhibiting a decrease in neuronal survival and 
repair. The proteins of the invention may further be useful for the treatment of conditions 
related to other types of tissue, such as epidermis, muscle, connective tissue, such as 
bone, cartilage, tendon and hgament. and other organs such as pancreas, liver, spleen, 
lung, cardiac, brain and kidney tissue. The proteins of the present invention may also 

15 have value as a dietary supplement, or as additives for cell culture media. For this use. 
the proteins may be used in intact form, or may be predigested to provide a more readily 
absorbed supplement. 

The proteins of the invention may also have other useful propenies characteristic 
of the Cerberus family of proteins. Such properties include angiogenic, chemotactic 
20 and/or chemoattractant properties, and effects on cells including differentiation responses, 
cell proliferative responses and responses involving cell adhesion, migration and 
extracellular matrices. These properties make the proteins of the invention potential 
agents for wound healing, reduction of fibrosis and reduction of scar tissue formation. 
The proteins of the invention may also be useful for the induction of formation of cells 
25 capable of secreting valuable hormones, including endocrine or exocrine hormones. 

A further aspect of the invention is a therapeutic method and composition for 
treating disorders of neurons and/or related neural cells and tissues such as Schwann 
cells, glial cells and astrocytes, and other conditions related to neuronal and neural tissue 
disorders or diseases. The invention further comprises therapeutic methods and 
30 compositions for wound healing and tissue repair. Such compositions comprise a 
therapeutically effective amount of at least one mammalian cerberus protein of the 
present invention in admixture with a pharmaceutically acceptable vehicle, carrier or 
matrix. It is further contemplated that composiuons of the invention may increase 
neuronal, glial cell and astrocyte survival and therefore be useful in transplantation and 
35 treatment of conditions exhibiting a decrease in neuronal survival. 
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5 II is exf^ted thai cerbenis and cerberiis4ike proteins may exist in nature as 

homodimers or heterodimers. To promote the formation of dimers of cerberus-like 
protein with increased stability, one can genetically engineer the DN A sequence of SEQ 
" ID NO: 1 or SEQ ID NO: 7 to provide one or more additional cysteine residues to increase 
potential dimer formation. The resulting DNA sequence would be capable of producing 

10 a "cysteine added variant" of cerberus-like protein. Alternatively, one can produce 
"cysteine added variants" of cerberus-like proteins by altering the sequence of the protein 
at the amino acid level, for example, by altering the amino acid sequences of one or more 
amino acid residues to Cys. Production of "cysteine added variants" of proteins is 
described in United States Patent 5,166322, the disclosure of which is hereby 

15 incorporated by reference. 

It is expected that the proteins of the invention may act in concert with or perhaps 
synergistically with other related proteins and growth factors. Such combinations may 
comprise separate molecules of the cerberus or cerberus-like proteins and other proteins 
or heteromolecules comprised of different moieties. For example, a method and 

20 composition of the invention may comprise a disulfide linked dimer comprising a 
cerberus protein subunit and a subunii from one of the "BMP" proteins. Thus, the 
present invention includes a purified cerberus-like polypeptide which is a heterodimer 
wherein one subunit comprises an amino acid sequence of SEQ ID N0;2 or SEQ ID 
NO: 8, and one subunit comprises an amino acid sequence for a bone morphogenetic 

25 protein selected from the group consisting of BMP-2, BMP-3, BMP-4, BMP-5, BMP-6, 
BMP-7, BMP-8, BMP-9, BMP- 10, BMP-1 1, BMP- 12 or BMP- 13, disclosed in PCT 
application WO 95/16035, or BMP-15, disclosed in PCT application WO96/36710 or 
BMP- 16, disclosed in co-pending patent application serial number 08/715/202, filed 
September 18, 1996. A further embodiment may comprise a heterodimer of cerberus 

30 moieties, for example, of Xenopus cerberus and a mammalian homologue of Xenopus 
cerberus or other cerberus-like protein. Further therapeutic methods and compositions 
of the invention therefore comprise a therapeutic amount of at least one mammalian 
cerberus protein of the invention with a therapeutic amount of at least one other protein, 
such as a member of the TGF-P superfanfiily of proteins, which includes the bone 

35 morphogenetic proteins (BMPs), growth and differentiation factors (GDFs) and other 
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5 proteins. The composition may include other agents and growth factors such as 
epidermal growth factor (EGF). fibroblast growth factor (FGF), platelet derived growth 
factor (PDGF), transforming growth factors (TGF-a and TGF-p), Wnts, hedgeghogs, 
including sonic, indian and desert hedgehogs, activins, inhibins, and k-fibroblast growth 
factor (kFGF), parathyroid hormone (PTH), leukemia inhibitory factor 
10 (LIF/HILDA/DIA), insulin-like growth factors (IGF-I and IGF-II). Portions of these 
agents may also be used in compositions of the present invention. 

The preparation and formulation of such physiologically acceptable protein 
compositions, having due regard to pH, isotonicity, stability and the like, is within the 
skill of the art. The therapeutic compositions are also presenUy valuable for veterinary 
15 applications due to the lack of species specificity in cerberus proteins. Particularly 
domestic animals and thoroughbred horses in addition to humans are desired patients for 
such treatment with the cerberus proteins of the present invention. 

The therapeutic method includes administering the composition topically, 
systemically, or locally as by injection or implantation. When administered, the 
20 therapeutic composition for use in tiiis invention is, of course, in a pyrogen-fi^. 
physiologically acceptable form. Further, the composition may desirably be encapsulated 
or injected in a viscous form for delivery to the site of neurons and/or related neural cells 
and tissues such as Schwann cells, glial cells and astrocytes or other tissue damage. 
Topical administration may be suitable for wound healing and tissue repair. 
25 Therapeutically useful agents other tiian the cerberus proteins which may also optionally 
be included in the composition as described above, may alternatively or additionally, be 
administered simultaneously or sequentially with the cerberus composition in the 
methods of the invention. 

For implantation, the composition preferably includes a matrix capable of 
30 ddivering mammalian cerberus proteins to the site of neurons and/or related neural cells 
and tissues such as Schwann cells, ghal cells and astrocytes or other tissue damage, 
providing a stmcture for tiie developing tissue and optimally capable of being resorbed 
into the body. The matrix may provide slow release of mammalian cerberus and/or other 
protein, as well as proper presentation and appropriate environmem for cellular 
35 infiltration. Such matrices may be formed of materials pr^semly in use for other 
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5 implanted medical applications. The choice of matrix material is based on 
biocompaiibility. biodegradability, mechanical properties, cosmetic appearance and 
interface properties. The particular application of the mammalian cerberus compositions 
"will define the appropriate formulation. 

The dosage regimen will be determined by the attending physician considering 
10 various factors which modify the action of the mammalian cerberus protein, e.g. amount 
of tissue desired to be formed, the site of tissue damage, the condition of the damaged 
tissue, the size of a wound, type of damaged tissue, the patient's age, sex, and diet, the 
severity of any infection, time of administration and other clinical factors. The dosage 
may vary with the type of matrix used in the reconstitution and the types of mammalian 
15 cerberus proteins in the composition. The addition of other known growth factors, such 
as IGF I (insulin like growth factor 1), to the final composition, may also effect the 
dosage. 

Progress can be monitored by periodic assessment of tissue growth and/or repair. 
The progress can be monitored, for example, x-rays, histomorphometric determinations 

20 and tetracycline labeling. 

The following examples illustrate practice of the present invention in recovering 
and characterizing mammalian cerberus protein and employing the DN A to recover other 
cerberus proteins, obtaining the human proteins and expressing the proteins via 
recombinant techniques. As will be recognized, numerous variations of the materials and 

25 methods described can be prepared and are within the invention. 

Example 1: Cloning of a Murine Homologue to Xenopus cerberus 

The carboxy-terminal, cysteine rich domain of xenopus cerberus was used to 
screen manamalian libraries and ESTS reported in GENBANK. An EST, AA 120 122, 
derived from the Beddington day 7.5 embryonic region library was identified by sequence 

30 homology to xenopus cerberus. While the partial clone has almost no sequence 
conservation in the N-terminal half of the molecule, the C-terminal domain and especially 
the pattern of the nine cysteines showed sequence conservation. The full-length cDNA 
encoding the murine cerberus-Wkt protein was isolated from a murine embryonic cell 
cDNA library and the gene isolated from a murine genomic library. 
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5 Murine cerbems-like cDNA (SEQ ID NO: 1 ) encodes a 272 amino acid protein 

with predicted MW 30.5 Kd (SEQUENCE ID NO:2). The murine cerbems-like gene, 
Mcerb-1, has a single 2Kb intron at position 564 of the cDNA SEQ ID N0:1. The 
predicted protein contains a hydrophobic signal sequence at its amino terminus, 
indicating the molecule is secreted. Analysis by Sigcleave indicates that the first 17 
1 0 resides are cleaved from the mature molecule upon secretion (Sigcleave score = 7.6). The 
mature murine cerberus-like protein is predicted to be a 255 residue protein of MW 28.6 
Kd including residues 18-272 of SEQ ID N0:2. The cerberus-like cDNA was expressed 
in COS cells and labeled with «S-Met/^'S-Cys. The resulting protein was secreted into 
the culture medium and resulted in a smeared band of MW 38-44 Kd when fractionated 
15 on 16% polyacrylamide reducing gels. The smeared bands indicate that the protein is 
glycosylated consistent with the two putative N-linked glycosylation sites in the 
sequence. Expression of the protein in reticulocyte systems shows a 33kD band 
consistent with predicted protein in the absence of glycosylation. When fractionated on 
non-reducing polyacrylamide gels, approximately half of the COS expressed material 
20 runs with a molecular weight of 78-84 Kd indicating the protein is capable of forming 
cysteine-linked homodimers. The protein sequence contains a hydrophobic signal 
sequence at its amino terminus and a cysleine-rich domain close to its caiboxy terminus. 
The cysteine-rich domains of Xenopus cerberus and murine cerberus-like proteins are 
58% identical and the overall amino acids are 3 1% identical. The nine cysteine pattern 
25 at the carboxy terminus is conserved between Xenopus cerberus and mammalian 
cerberus-like protein. 

Example 2: Mammalian cerberus- like Encnriw a Secreted PrAfPin 

To test whether the full length cDNA encoded a secreted protein, the 293T human 
cell line was transiently transfected with the DNA sequence of SEQ ID NO. l, which 

30 encodes mammalian cerberus-like protein, cloned in an eukaryotic expression vector and 
labeled with ^^S-methionine. A broad band was secreted into the culture medium, as well 
as a minor form. Similar to the COS cell expression, fractionation of the material 
expressed in 293T cells evidenced the formation of cysteine-linked dimers. The 
cerberus-like protein is presumably glycosylated since the protein translated in the 

35 reticulocyte system (in the absence of membranes) results in a band of 33 kD molecular 
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5 weight corresponding to the molecular weight predicted from the amino acid sequence. 
Cerberus-like protein has two putative N-linked glycosylation sites. Accordingly, the 
DNA of SEQ ID NO: 1 defines a secreted protein with high amino acid identity to 
" Xenopus cerberus in the cysieine-rich domain. 
Example 3: Expression of Mammalian Cerberus-like Protein 

10 In prestreak mouse embryos (5.5 days post coitum), manmialian cerberus-like 

transcripts were detected on one side of the primitive endoderm, including the distal tip 
of the embryo. At early primitive streak, expression was found in a patch of primitive 
endoderm cells on one side of the embryonic region and no longer extends to the tip of 
the embryo. This patch corresponded to the anterior side of the embryo, because in 

15 sections it is found in the endoderm opposite to the forming primitive streak which can 
be recognized as a thickening of the posterior epiblast. At mid-streak, the cerberus-like 
positive area remained in the anterior primitive endoderm. At the late streak stage, when 
the node has reached the distal tip of the embryo, a second population of cerberus-like 
expressing was seen in the region surrounding the node. These cells presumably 

20 correspond to definitive endodermal cells, that are known to emerge from the node. The 
endodermal nature of this cell population was confirmed by histological analysis. At the 
neural plate stage, cerberus-like is found underiying the anterior neural plate, in a pattern 
comparable to the domain of Otx2 expression in endoderm. Ang et al., Cell , 78:561-574 
(1994). At this stage, the cerberus-like positive cell population presumably consists of 

25 both primitive and definitive endoderm, and do not include the node itself. Imponantly, 
cells expressing the cerberus-like secreted factor are in direct contact with cells that 
subsequently give rise to the fore- and midbrain region of the CNS. 

At the early headfold stage, cerberus-like signal in the endoderm stats to weaken. 
At late headfold, cerberus-like expression is confined to the midline and adjoining 

30 endoderm. Expression is found in all cells of the midline from the rostral end of the 
embryo to the proximity of the node and includes anterior gut endoderm and mesoderm 
from the prechordal and notochordal plates. Expression of cerberus-like mRNA in 
anterior endoderm remains until the start of somitogenesis and then becomes 
undetectable. Concomitantly, a late phase of expression begins in the mesoderm of the 

35 somites. Thus, cerberus-like defines an anterior domain of the endoderm in early mouse 
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5 embryos. Expression is found in anterior endoderm in direct contact with the future 
neural plate, but never detected in posterior endoderm cells in contact with the primitive 
streak, providing further evidence in support of the role oi cerberus-like in the induction 
of anterior neuroectoderm. 

Example 4: Assays m Defermine Function r..^,ru. or r../..„»./.^. ;^ .u. 
10 Mouse 

To determine function of cerberus and cerberus-like genes in the mouse embryo, 
transgenic knockout and misexpression mice can be made by engineering of murine 
embryonic stem (ES) cells and injection into the blastula using standard procedures. The 
murine knockout can be achieved by replacing a central portion of the cerbems or 
15 cerberus like genes with a selectable marker ie.g., neo), transfecting the constmct into ES 
cells and selecting for the double crossover. For misexpression. the cerberus and 
cerberus-m genes can be engineered to be expressed from generic (e.^.. actin) or tissue 
specific {e.g., mx) promoters and reintroduced into murine ES cells. Transgenic animals 
can be generated from such engmeered cells using classical procedures. 

To determine the function of the cerberus or cerbenis like proteins in the adult 
mouse, the protein can be directly injected into tissues or delivered by viral vectors For 
example, the cerberus or cerberus-like genes can be transiently overexpressed in the adult 
mouse using adenovirus vectors and the function or activity of the proteins investigated 
by physiological, histochemical and biochemical analysis of the animals. 
Examples: Tsolation of H„m.n r..^ erus «nH r..A.».. ^,„r_ 

Hybridization of the murine cerberus-like gene to human DNA sequences 
.dentified a single Xbal generated band of approximately 15 Kb demonstrating its utility 
as probe to identify the human cerberus-like gene within a human genomic library 
Alignment of theXe«op«. cerberus and murine cerberus-like genes has identified three 
30 regions of sufficient nucleic acid homology within the cysteine rich domain to serve as 
probes for the human cerbems and cerberus-like genes. The consensus sequences for the 
three probes are: 

(1) TGCCCrrCAMYCAGAMYATTGYMCATGAARACTGT [SEQ ID NO-3]- 

(2) CAGAACAAYCTKTGCrrrGGTAAATGCA [SEQ ID NO-4]- and 
35 (3)TGYTCCCAYTGCTYGCCSWCCAAATr[SEQIDNO-5] 



20 



28 



BNSDOCIP- <WO P90155?fli 1 > 



wo 99/01553 



PCT/US98/n462 



5 The first probe spans the 2 Kb intron in the mouse gene and is therefore less likely 

to function well as a probe. Used individually or together, the latter two oligos 
containing 6-fold and 10-fold degeneracy, respectively, would serve as probes for human 
" cerberus and cerberus-like genes in genomic or cDNA libraries. 

The nucleotide and amino acid sequences obtained for human cerberus according 
10 to the procedures described herein are shown in SEQ ID N0:7 and SEQ ID NO:8, 
respectively. 

Example 6: Identification and Isolation of Additional Cerberus and Ce rberus-like 
Proteins^ 

The cerberus and cerberus-like proteins are members of a family which can be 

1 5 recognized by their unique cysteine pattern. This family includes cerberus, cerberus-like 
protein, Dan protein and Norrie protein, Dan protein is a mmor suppressor candidate and 
defects in the Norrie protein result in congenital defects including blindness and deafness. 
Members of the cerberus family thus seem to play an important role in cell differentiation 
and proliferation and thus it is important to find other members of this family of proteins. 

20 Family members cannot necessarily be recognized by amino acid homology, which 
demonsurates a significant amount of diversity, but can be recognized by a unique pattern 
of nine cysteines. The consensus cysteine pattern in each of these proteins is: C (X,3.i5) 
C (X9) CxGxC (X,4.23) CXXC (X^j) C (X,^,^) CXC, wherein X^ indicates the number of 
non-cysteine amino acid residues found between each cysteine in the conserved cysteine 

25 pattern of the cerberus and cerberus-like families of proteins. 

By searching mammalian ESTs for this motif, novel members of the cerberus 
family have been identified including the human EST N35377 and the murine EST 
AA289245. Using standard procedures, the ftill length genes can be isolated from either 
genomic or cDNA libraries. These genes are expected to encode signaling proteins which 

30 function to pattern the embryo, control cellular differentiation or cell proliferation and 
thus be candidate proteins in medical therapy. 

Within the cysteine-rich domains of Xenopus cerberus and murine cerberus-like 
proteins are two discrete subregions which are highly conserved cysteine motifs and may 
be particularly useful for the identification and isolation of cerberus and cerberus-like 

35 proteins from other species, as well as related members of the cerberus and cerberus-like 
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5 protein families. Use of these highly conserved motifs to screen mammalian libraries and 
ESTS reported in GENB ANK is expected to identify additional proteins of the cerberus 
and cerberus-like families of proteins from humans and other species, as well as identify 
further family members may be found. The first region is the motif C-X-G-X-C, which 
corresponds to Cys-Phe-Gly-Lys-Cys found at amino acid residues 186 to 190 of SEQ 
10 ID NO: 1 . Thus, degenerate oligonucleotides to the sequence TGC TTT GGC AAA TGC 
at nucleotide positions 613 to 627 of SEQ ID N0:1, and the adjoining regions, may be 
useful to identify and isolate other genes which share the C-X-G-X-C moiif of cerberus 
and cerberus-like proteins. A second highly conserved region within the cysteine-rich 
domains of Xenopus cerberus and murine cerberus-like proteins is the motif C-X-X-C, 
1 5 which corresponds to Cys-Ser-His-Cys found at amino acid residues 206 to 209 of SEQ 
ID N0:1. Thus, degenerate oligonucleotides to the sequence TGC TTC CAC TGC at 
nucleotide positions 673 to 684 of SEQ ID N0:1, and the adjoining regions, may be 
useful to identify and isolate other genes which share the C-X-X-C motif of c^rW and 
cerberus-like proteins. Using the above motifs, additional proteins of the cerberus and 
20 cerberus-like families of proteins may be identified from other species, and further family 
members from humans, mice or frogs may be found. Use of the two motifs above in 
concert may further provide specific identification and isolation of additional cerberus 
and cerberus-like family members. 

Example 7; Mtcroiniertion Assays fnr r^rh ^rus anri r^rh.r..,.uke Activity 

25 Injection of mRNA encoding xenopus cerberus into xenopus blastomeres at 

different stages (4 cell, 8 cell and 32 cell) has profound effects on the expression pattern 
of specific genes and on the resulung morphology of the developing embryo. 
Misexpression of cerberus inhibits development of the prechordal plate, notochord and 
ventral trunk mesoderm and their respective molecular markers, goosecoid, collagen H 

30 and x-globin. Microinjection of cerberus induces anterior neuroectodermal structures 
such as brain, olfacto^r placodes and cement gland. Cerberus induced gene expression 
includes N-CAM (brain), Otx2 (anterior brain), CG-J3 (cement gland) and Nkx-2.5 (heart 
primordium). Induction of neural tissue by cerberus was specific to anterior region of 
the brain as indicated by upregulation of the Otx2 marker but not more posterior markers 
35 including En.2 (midbrain-hind bram junction), Krox-20 (hind brain) and XlHbox-6 
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5 (spinal cord). Injection of cerberus into specific cells of the 32-cell blasiomere resulted 
in the induction of ectopic heads, and duplicate heart and liver. 

Like cerberus. microinjection of cerberus-like mRNA into animal cap explants 
' induces anterior CNS in xenopus embryos. However, microinjection of mammalian 
cerberus-like mRNA into xenopus embryos did not induce formation of ectopic head 
10 structures, for example, containing forebrain, cyclopic eyes, olfactory placodes and 
cement glands, suggesting overlapping but not identical functional effects. Thus, 
cerberus-like is a neuralizing factor, that leads to the formation of forebrain in xenopus 
assays. 

Example 8. Additional Embryonic Stem Cell Assay 

15 In order to assay the effects of the cerberus-like proteins of the present invention, 

it is possible to assay the growth and differentiation effects in vitro on a number of 
available embryonic stem cell lines. One such cell line is ES-E14TG2, which is available 
from the American Type Culture Collection in Rockville, Md. 

In order to conduct the assay, cells may be propagated in the presence of 100 units 

20 of LIF to keep them in an undifferentiated state. Assays are setup by first removing the 
LIF and aggregating the cells in suspension, in what is known as embryoid bodies. After 
3 days the embryoid bodies are plated on gelatin coated plates (12 well plates for PCR 
analysis, 24 well plates for immunocyiochemistry) and treated with the proteins to be 
assayed. Cells are supplied with nutrients and treated with the protein factor every 2-3 

25 days. Cells may be adapted so that assays may be conducted in media supplemented with 
15% Fetal Bovine Serum (FBS) or with CDM defined media containing much lower 
amounts of FBS. 

At the end of the treatment period (ranging from 7-21 days) RNA is harvested 
from the cells and analyzed by quantitative multiplex PCR for the following markers: 
30 Brachyury, a mesodermal marker, AP-2, an ectodermal marker, and HNF-3a an 
endodermal marker. Through immunocytochemistry, it is also possible to detect the 
differentiation of neuronal cells (glia, astrocytes and neurons), muscle cells 
(cardiomyocytes, skeletal and smooth muscle), and various other phenotype markers such 
as proteoglycan core protein (cartilage), and cytokeratins (epidermis). Since these cells 
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5 have a tendency to differentiate autonomously when LIF is removed, the results are 
always quantitated by comparison to an untreated control. 
Example 9 ■ EvprP« sion of Hnnmn Cerherux Prnf«.{» 

The human Cerberus cDNA sequence (SEQ ID N0:7) has been expressed in COS 
cells using mammalian expression vector, as described in the application. The observed 
10 protein was secreted, with a molecular weight of approximately 35-45 kD, consistent 
with a glycosolated protein of 250 (amino acids #18 to #267 of SEQ ID NO:8) amino 
acids in length. 

The foregoing descriptions detail presently preferred embodiments of the present 
invention. Numerous modifications and variations in practice thereof are expected to 
15 occur to those skilled in the art upon considerarion of these descriptions. Those 
modifications and variations are believed to be encompassed within the claims appended 
hereto. 

All of the publicadons and patents referred to herein are hereby specifically 
mcorporated by reference, as if fully set forth herem for the referred to disclosure. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(I) APPLICANT: FOLLETTIE, MAXIMILLIAN 
DeROBERTIS, EDWARD M. 

(ii) TITLE OF INVENTION: Mammalian Cerberus-Like Protein & 
Compositions 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: US 

(F) ZIP: 02140 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: TBD 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: LAZAR, STEVEN R 

(B) REGISTRATION NUMBER: 32,618 

(C) REFERENCE/DOCKET NUMBER: GI 5290APCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-8260 

(B) TELEFAX: (617) 876-5851 



(2) INFORMATION FOR SEQ ID N0:1: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 5 8.. 873 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GAATTCGGCC AAAGAGGCCT ATGTGAATCT AACCTCAGTC TCTGGGAATC AGGAAGC 57 
ATG CAT CTC CTC TTA GTT CAG CTG CTT GTT CTC TTG CCT CTG GGG AAG 105 
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Met His Leu Leu Leu Val Gin Leu Leu Val Leu Leu Pro Leu Gly Lys 

^ 5 10 .15 

GCA GAC CTA TGT GTG GAT GGC TGC CAG AGT CAG GGC TCT TTA TCC TTT 
Ala ASP Leu Cys Val Asp Gly Cys Gin Ser Gin Gly Ser leu Ser pS 
20 25 30 

Pro Leu Zt f° '^^^ ^^^^^C GTG GCC AAC CAC GAG 

Pro Leu Leu Glu Arg Gly Arg Arg Asp Leu His Val Ala Asn His Glu 

40 45 

n^^ CTG TTT GTG GCC GTG CCA CAC CTC ATG 

Glu Ala Glu ASP Lys Pro Asp Leu Phe Val Ala vll Pro hS Leu Met 

55 60 

GGC ACC AGC CTG GCT GGG GAA GGC CAG AGG CAG AGA GGG AAG ATG CTG 
Gly Thr ser Leu Ala Gly Glu Gly Gin Arg Gin Arg G^y JJs uTt Leu 

^° 75 80 

TCC AGG CTT GGA AGA TTC TGG AAG AAA CCT GAG ACC GAA TTT TAC err 
ser Arg Leu Gly Arg Phe Trp Lys Lys Pro Glu Jhr G^^ Pro 
85 90 95 



200 205 



153 



201 



249 



297 



345 



Z X Z SI 51^ S S ^ S 

s s= ^ s z ?i? - s SI s s= ir. 
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S s z z Z. S S ^ ^. Z I?, s 

a= s s s s s S s 
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^? s s 5- s z ?s ti: s SI - SI 
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TCC CAG GGT TCC TTC ATC CCT GGA CTT CCA GCT TCA AAA AC A AAC CCA 873 
Ser Gin Gly Ser Phe lie Pro Gly Leu Pro Ala Ser Lys Thr. Asn Pro 
260 265 270 

TGATTACCTC AACAGAAAGC AAAACCTCAA CAGAATAAGT GAGGGTTATT CAATCTGGAA 933 

ATGTTATGTG AGTTATATAA AGATCAGTGG AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 993 

AAGCGGCCGC 1003 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met His Leu Leu Leu Val Gin Leu Leu Val Leu Leu Pro Leu Gly Lys 
15 10 15 

Ala Asp Leu Cys Val Asp Gly Cys Gin Ser Gin Gly Ser Leu Ser Phe 
20 25 30 

Pro Leu Leu Glu Arg Gly Arg Arg Asp Leu His Val Ala Asn His Glu 
35 40 45 

Glu Ala Glu Asp Lys Pro Asp Leu Phe Val Ala Val Pro His Leu Met 
50 55 60 

Gly Thr Ser Leu Ala Gly Glu Gly Gin Arg Gin Arg Gly Lys Met Leu 
65 70 75 80 

Ser Arg Leu Gly Arg Phe Trp Lys Lys Pro Glu Thr Glu Phe Tyr Pro 
85 90 95 

Pro Arg Asp Val Glu Ser Asp His Val Ser Ser Gly Met Gin Ala Val 
100 105 110 

Thr Gin Pro Ala Asp Gly Arg Lys Val Glu Arg Ser Pro Leu Gin Glu 
115 120 125 

Glu Ala Lys Arg Phe Trp His Arg Phe Met Phe Arg Lys Gly Pro Ala 
130 135 140 

Phe Gin Gly Val He Leu Pro He Lys Ser His Glu Val His Trp Glu 
145 150 155 160 

Thr Cys Arg Thr Val Pro Phe Asn Gin Thr He Ala His Glu Asp Cys 
165 170 175 

Gin Lys Val Val Val Gin Asn Asn Leu Cys Phe Gly Lys Cys Ser Ser 
180 185 190 

He Arg Phe Pro Gly Glu Gly Ala Asp Ala His Ser Phe Cys Ser His 
195 200 205 

Cys Ser Pro Thr Lys Phe Thr Thr Val His Leu Arg Leu Asn Cys Thr 
210 215 220 

35 
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Ser Pro Thr Pro Val Val Lys Met 
225 230 

Cys Met Val Lys Thr Glu Arg Gly 
245 

Ser Gin Gly Ser Phe He Pro Gly 
260 



Val Met Gin Val Glu Glu Cys Gin 

235 240 

Glu Glu Arg Leu Leu Leu Ala Gly 

250 255 

Leu Pro Ala Ser Lys Thr Asn Pro 

265 _ 270 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TGCCCTTCAM YCAGAMYATT GYMCATGAAR ACTGT 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CAGAACAAYC TKTGCTTTGG TAAATGCA 

28 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TGYTCCCAYT GCTYGCCSWC CAAATT 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 117.. 623 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 624.. 2402 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 2403.. 2699 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

TCCGGCCAGG CAGGTATCTA TATATCCGAT TTCCTTTTTC CCAAGTCCTG CAGAAGAATG 60 

AGCCTCTCCT TTGGGCCTCA TCATTTTACC AAAAAGAAGC TTGGGCCCCT GACAGCATGC 120 

ATCTCCTCTT ATTTCAGCTG CTGGTACTCC TGCCTCTAGG AAAGACCACA CGGCACCAGG 180 

ATGGCCGCCA GAATCAGAGT TCTCTTTCCC CCGTACTCCT GCCAAGGAAT CAAAGAGAGC 240 

TTCCCACAGG CAACCATGAG GAAGCTGAGG AGAAGCCAGA TCTGTTTGTC GCAGTGCCAC 300 

ACCTTGTAGC CACCAGCCCT GCAGGGGAAG GCCAGAGGCA GAGAGAGAAG ATGCTGTCCA 3 60 

GATTTGGCAG GTTCTGGAAG AAGCCTGAGA GAGAAATGCA TCCATCCAGG GACTCAGATA 420 

GTGAGCCCTT CCCACCTGGG ACCCAGTCCC TCATCCAGCC GATAGATGGA ATGAAAATGG 4 80 

AGAAATCTCC TCTTCGGGAA GAAGCCAAGA AATTCTGGCA CCACTTCATG TTCAGAAAAA 540 

CTCCGGCTTC TCAGGGGGTC ATCTTGCCCA TCAAAAGCCA TGAAGTACAT TGGGAGACCT 600 

GCAGGACAGT GCCCTTCAGC CAGGTATGTG TTCTGGGGGG AGAGCAGGTA AGAGTTTGCA 660 

GGTGGTAGTG GACAGCTGGG ATGGATGGAG AGTAGGGGAA AAGGCTGTCA GGAGCCTGAC 720 

TCTAGCTTAA CTACAGATTT GGTCCTTGGG CATTCATCAT AGGATTTGGC AAAGATTAAG 780 

TTTCCTTCTG GCCTTTCCAT TTTTTCTTGG CATTGTGGAA ATGCTGCAAG AATGATATGA 840 

TGATACTGTC AATATCAGTA ATCATTCATT CACACTGAAG ACACAGAGCT CTGTTTTATT 900 

TATTTATTTT TGCATTGGAG GTGATCTACT CAGAGATATA AGTCAGACTG TACCCTCAGT 960 

TAGGAAACTG AGAATTTAGA GAATCACCAG AACTCCTCTG TAGCTATCTT TCTGCACTCT 1020 

ATTAATATGG GATGAGCAGG TCAACTCCCA TTTGTTGATA AAGTGGGGTG CATTGGACTC 1080 

CTTCCCAAAT ACTCTCATAT CCATTTACGA TGGTCTTAAT CCCCATAGTC CATACTTAAT 1140 
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TACTTTATAG 
AAAGTACGTT 
AAAAGTGATT 
TGAGATCATA 
TTTGTAGTTA 
TCCTCTGGCA 
TTAAACATGA 
AATGCTACCC 
ATTCTGCCTG 
CTACTCTGTT 
TTTTGAAATT 
TTACATATTT 
GTAATATNCA 
ACTTGAATGT 
TATTCTACGA 
GGAATAAAAA 
GCTCACATCC 
TTTTTTATGT 
ATTTAATTGA 
TCCAAAATTG 
TGCATTATGT 
AGACTATAAC 
AATGCGGGTC 
GTTTGCCTGC 
TGATCAAGGT 
ATGGACACAT 
GAGCTATCCC 
AGTCTGAAAA 
TTTAAATTAG 
TTGTCTCAGC 
TAAAGATTTT 
TACTAATATT 



GTTTATGAGG 
GAGGTTCTCA 
TTCTTTCACT 
TCTCCTCCCC 
TTATGATCAA 
ATATCTACAT 
AGATATTGAA 
AATTAAGCAA 
AAAAGGCAAG 
GGCCATGTGC 
CTTCTTTTGA 
TGAAAGGCAT 
ATGATTTGCA 
ACTACAGGTC 
TCAATGTAAC 
AGTAAGGCAC 
AATTAGACAA 
TAACTTAGGG 
GGTGGTTCTG 
GAAAGCTTTG 
GTGTTAAAGA 
CCACGAAGGC 
TGTTCATTTT 
CAAGTTCACC 
GGTGATGCTG 
CCTACATGCT 
ACTATTACCT 
TGTTAAGTGG 
AACGATAGAG 
TCTGCCACTA 
CATCTATAAA 
CCGTGATGCA 



GACTTCTTTA 
GGCAAAAGTT 
TATTTTCTCA 
TTAGAAGTAC 
TTCCATGCTA 
TCCAAATGTT 
CCAAAAACAT 
GCAATAGAAT 
GATATGTAGC 
AATTAGAAAA 
ATTCTATTGC 
ATAAAGCTAG 
GGCATATGCA 
TTGCAGAATA 
AAATATGTAT 
ATTACTTATG 
AATTGCTTAA 
ATTCCCTAAA 
TGCCCTTATA 
AAATTTTTAA 
AATAATTCAA 
TGTGAAAAAG 
CCTGGAGCCG 
ACGATGCACT 
GTGGAGGAGT 
GGCTCCCAGG 
TTGAAAAGCA 
GTACATAACA 
GAAATGATAT - 
ACTAGCCGTA 
TGACGGACCC 
TTTTCTCCAA 



ATAGCTTGCT 
GTCATATCAT 
TATGAGCTTT 
CTTTCTCCTG 
TTAAGACAAA 
AAATTAAAAT 
GCAGGGTAGA 
AGGGCAATTG 
AATGGCAAGT 
TTACCCCTAA 
TAATTAAATT 
GTGCTTGGAG 
TTGTAACTCT 
GGATAGAATT 
TGGGAGCCTA 
TAAGATAATT 
GGTTTTGCAC 
GGCTGTTTAA 
GATACCATCA 
ATTATCCTCA 
AATAACGTAA 
TAGTTGTTCA 
CGCAGCACTC 
TGCCACTGAA 
GCCAGTGCAA 
ATTCCTTTAT 
AAACCACAAC 
TTTTCAGGGA 
- TAGTCTAGTT 
GAATGTTAAG 
GACCTAGATG 
GTTTGGGTAA 



AAAGCTTATC 
TTCTAGTATT 
TTAAAAAATC 
ATTCATGTTG 
GGGACATCCT 
TGAGAACTTG 
GTAAAATTTT 
ACTGTTCAAG 
CAATTATCAA 
GAATCAGGCA 
AAAACTAAGA 
TTATGAGAGG 
GCTTGCATAC 
AAACCTAGAA 
CTATGCACAA 
ACCATTAGAA 
GAATAATGTA 
TAATTTACTC 
CTTGCATATT 
GATTTACAGT 
TGGAAATGTG 
GAACAACCTT 
CCATACCTCC 
CTGCACTGAA 
GGTGAAGACG 
CCCAGGAGTT 
AGCAAAGATG 
AAGGTGACTT 
ATTGGTACAC 
TTGTAAAACC 
ATTGCTAAAA 
AAGCCCTCCA 



CCACAACCTC 
ATGATAGCAA 
AATCTTGATG 
TGTTGGCTGA 
ACTGTCTACT 
CATTAGGTCC 
ATAGTCGAGT 
GCAGTTAAGT 
ATAATAATGA 
ATCAAATTTC 
TGTTTGACTC 
TAAAGGTGAT 
AACTTCATAG 
TGTTCTGATC 
AGCCCTGTGA 
TTTTTCAATC 
GAGTTAAATA 
AATAAAGAAA 
GCAAATTGTA 
CCATAGCTTC 
TTTGCTTTTT 
TGCTTTGGGA 
TGCTCTCACT 
CTTTCCTCCG 
GAGCATGAAG 
TCAGCTTGAA 
CTGATTATTC 
GAAACGTAGT 
GTTTGAGACC 
TTTCTCCATC 
TCCTTTCCAC 
TCTAAAGAAG 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 
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GAAAAGAAAT AAGCGAGACC ATAAAAATGG GCTTCTTTAA TGTGTGTCAA ATCACCAGCA 3120 

AGCAAAGAAG CAAGATAGAG AGGGAGGAAG GAAGGAAGGA AGGAAGGAAA GAAGGAAGGA 3180 

AGGAAGGAAG GAAGGAAGGA AGGAAGGAAG GAAGGAAGGA AGGAAGGCAG GCAGGCAGGC 3240 

AGGCAGGCAG GCAGGCAGGC AGGCAGGCAG GGAGGCAGGC TACGTGAAAT ATTTGTAGGA 3300 

AAGATTCTCA TACTTATAGT TACTTTTGCA ACCCAAACAG TGTTTTACTT GACTTCTATC 3360 

TGATGATTAA GTCTTTCCAC AGATGTAAGG AGTAACTTGC TTGGTTGCCT CCTTTTTAAC 3420 

AATACTCCTC ATATAAAGTA CTTAATGTCA GGTCTCTGAC TTTGAAGAAG GAACAGTGAT 3480 

GTTAATTTTA GTAGTTTATA TAGGGAAGAG GAACAATCAC TGGTAGCCAA ACAAGTACCT 3540 

ATATTATGAG GAAGGAAAAA TACATGACTA CTACCAGGTT TAGAGATCCG AATTC 3595 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..804 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ATG CAT CTC CTC TTA TTT CAG CTG CTG GTA CTC CTG CCT CTA GGA AAG 48 
Met His Leu Leu Leu Phe Gin Leu Leu Val Leu Leu Pro Leu Gly Lys 
1 5 10 15 

ACC ACA CGG CAC CAG GAT GGC CGC CAG AAT CAG AGT TCT CTT TCC CCC 96 
Thr Thr Arg His Gin Asp Gly Arg Gin Asn Gin Ser Ser Leu Ser Pro 
20 25 30 

GTA CTC CTG CCA AGG AAT CAA AGA GAG CTT CCC ACA GGC AAC CAT GAG 144 
Val Leu Leu Pro Arg Asn Gin Arg Glu Leu Pro Thr Gly Asn His Glu 
35 40 45 

GAA GCT GAG GAG AAG CCA GAT CTG TTT GTC GCA GTG CCA CAC CTT GTA 192 
Glu Ala Glu Glu Lys Pro Asp Leu Phe Val Ala Val Pro His Leu Val 
50 55 60 

GCC ACC AGC CCT GCA GGG GAA GGC CAG AGG CAG AGA GAG AAG ATG CTG 240 
Ala Thr Ser Pro Ala Gly Glu Gly Gin Arg Gin Arg Glu Lys Met Leu 
65 70 75 80 

TCC AGA TTT GGC AGG TTC TGG AAG AAG CCT GAG AGA GAA ATG CAT CCA 288 
Ser Arg Phe Gly Arg Phe Trp Lys Lys Pro Glu Arg Glu Met His Pro 
85 90 95 

TCC AGG GAC TCA GAT AGT GAG CCC TTC CCA CCT GGG ACC CAG TCC CTC 33 6 

Ser Arg Asp Ser Asp Ser Glu Pro Phe Pro Pro Gly Thr Gin Ser Leu 
100 105 110 
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ATC CAG CCG ATA GAT GGA ATG AAA ATG GAG AAA TCT CCT CTT CGG GAA 
II. .;in Pro lie Asp Gly Met Lys Met Glu Lys Ser Pro Leu Sg 

120 

GAA GCC AAG AAA TTC TGG CAC CAC TTC ATG TTC AGA AAA ACT CCG GCT 
Glu la Lys Lys Phe Trp His His Phe Met Phe Arg Jhr Pro III 

140 

S 0=?= s s S S It s s s 

150 ^gjj 

?S ?S ?S S S a= r S S 5- ^ ^ 

165 170 

GAA AAA GTA GTT GTT CAG AAC AAC CTT TGC TTT GGG AAA TGC crr -vrT 
Glu Lys Val Val Val Gin Asn Asn Leu Cys Phe fly ^ ^ 

185 

vS SI SI S S s s s I - - 

s SI K= - s ?s ^i? s s .^„= jsi 

220 

SI s s SI? s - ™ s s Si 

?g s= - - - - s c„ 

250 255 
TCC CAG GAT TCC TTT ATC CCA GGA GTT TCA GCT TGA 
Ser Gin Asp Ser Phe He Pro Gly Val Ser Ala * 
260 265 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met His Leu Leu Leu Phe Gin Leu Leu Val Leu Leu Pro Leu Gly Lys 

1° 15 
Thr Thr Arg His Gin Asp Gly Arg Gin Asn Gin Ser Ser Leu Ser Pro 

2^ 30 
val Leu Leu Pro Arg Asn Gin Arg Glu Leu Pro Thr Gly Asn His Glu 

Glu Ala Glu Glu Lys Pro Asp Leu Phe Val Ala Val Pro His Leu Val 



384 



432 



480 



528 



576 



624 



672 



720 



768 



804 



55 60 
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Ala Thr Ser Pro Ala Gly Glu Gly Gin Arg Gin Arg 
65 70 75 

Ser Arg Phe Gly Arg Phe Trp Lys Lys Pro Glu Arg 
85 90 



Glu Lys Met Leu 
80 

Glu Met His Pro 
95 



Ser Arg Asp Ser Asp Ser Glu Pro Phe Pro Pro Gly 
100 105 



Thr Gin Ser Leu 
110 



lie Gin Pro lie Asp Gly Met Lys Met Glu Lys Ser 
115 120 



Pro Leu Arg Glu 
125 



Glu Ala Lys Lys Phe Trp His His Phe Met Phe Arg Lys Thr Pro Ala 
130 135 140 



Ser Gin Gly Val lie Leu Pro lie Lys Ser His Glu 
145 150 155 



Val His Trp Glu 
160 



Thr Cys Arg Thr Val Pro Phe Ser Gin Thr lie Thr 
165 170 



His Glu Gly Cys 
175 



Glu Lys Val Val Val Gin Asn Asn Leu Cys Phe Gly 
180 185 



Lys Cys Gly Ser 
190 



Val His Phe Pro Gly Ala Ala Gin His Ser His Thr 
195 200 



Ser Cys Ser His 
205 



Cys Leu Pro Ala Lys Phe Thr Thr Met His Leu Pro Leu Asn Cys Thr 
210 215 220 



Glu Leu Ser Ser Val He Lys Val Val Met Leu Val 
225 230 235 



Glu Glu Cys Gin 
240 



Cys Lys Val Lys Thr Glu His Glu Asp Gly His He 
245 250 



Leu His Ala Gly 
255 



Ser Gin Asp Ser Phe He Pro Gly Val Ser Ala 
260 265 



41 



BNS0OCl{> <VVO I > 



wo 99/01553 



PCT/US98/11462 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(POT Rule I3bts) 





A. The indications made beiow relate to the miaoorganisra referred to in the description 


B^IDENTIFICATIONOFDEPOSrr Further deposits are identified on an additional sheet □ 


Name of depositary institution 
American Type Culture Collection 


Address of depositary instinition (including postal code and countrvf 
12301 Parklawn Drive, Rockville, Maryland, 20852, United States of America 


Date ol deposii 
11 March 1997 

C. ADDITIONAL INDICATIONS iUcye blank if no, applicab 


Accession Number 
ATCC 98347 

h) This infomaation is continued on an additional sheet Q 




D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE OfU.in,ic.uo„sar.no,fcr.l,^^c,Suuc^ 






E. SEPARATE FURNISHING OF INDICATIONS (Ica^ blank if net applicable) 






1 


^^Tljis sheet was received with the international application 




□ This sheet was received by the International Bureau on: 




Authorized officer ^ 




Authorized officer 
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What is claimed is: 

1. An isolated DNA sequence comprising a DNA sequence selected from the group 
consisting of: 

"(a) nucleotides from nucleotides beginning at # 1, 52, 55, 58, 61, 64, 67, 70. 73, 121, 
256, 259, 262, 265, 268, 271 or 484 and ending at #723 or 801 of SEQ ID N0:7; and 

(b) sequences which hybridize to (a) under stringent hybridization conditions and 
encode a protein which exhibits cerberus activity. 

2. An isolated DNA sequence comprising a DNA sequence selected from the group 
consisting of: 

(a) nucleotides encoding amino acids starting with amino acids beginning at # 1, 18 to 
25, 4 1 , 85 to 9 1 or 1 62 and ending at #24 1 or 267 of SEQ ID NO: 8; and 

(b) sequences which hybridize to (a) under stringent hybridization conditions and 
encode a protein which exhibits cerberus activity. 

3. A vector comprising a DNA molecule of claim 1 in operative association with an 
.expression control sequence therefor. 

4. A vector comprising a DNA molecule of claim 2 in operative association with an 
expression control sequence therefor. 

5. A host cell transformed with the vector of claim 3. 

6. A host cell transformed with the vector of claim 4. 

7. An isolated DNA molecule comprising a DNA sequence selected from the group 
consisting of: 

(a) nucleotide #268 to #801 of SEQ ID NO:2; and 

(b) naturally occurring allelic sequences and equivalent degenerative codon 
sequences of (a). 

8. A vector comprising a DNA molecule of claim 7 in operative association with an 
expression control sequence therefor. 

9. A host cell transformed with the vector of claim 8. 

10. An isolated DNA molecule encoding mammalian cerberus protein, said DNA 
molecule comprising nucleotide #268 to #801 of SEQ ID N0:7. 
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1 1 . An isolated DNA molecule according to claim 10, further comprising a 
nucleotide sequence encoding a suitable propeptide 5' to and linked in frame to the DNA 
coding sequence. 

12. A vector comprisingaDNA molecule of claim 11 in operative association with 
an expression control sequence therefor. 

13. A host cell transformed with the vector of claim 12. 

14. A method for producing purified mammalian cerberus protein, said method 
comprising the steps of: 

(a) culturing a host cell transformed with a DNA sequence according to claim I, 
comprising a nucleotide sequence encoding mammalian cerberus protein; and 

(b) recovering and purifying said mammalian cerberus protein from the culture 
medium. 

15. A method for producing purified mammalian cerberus protein said method 
comprising the steps of: 

(a) culturing a host cell transformed with a DNA sequence according to claim 2, 
comprising a nucleotide sequence encoding mammalian cerberus protein: and 

(b) recovering and purifying said mammalian cerberus protein from the culture 

medium. 

1 6. A method for producing purified mammalian cerberus protein said metiiod 
comprising the steps of: 

(a) culturing a host cell transformed with a DNA sequence according to claim 10, 
comprising a nucleotide sequence encoding mammalian cerberus protein; and 

(b) recovering and purifying said mammalian cerberus protein from the culture 

medium. 

17. A purified mammalian cerberus polypeptide comprising an amino acid sequence 
according to SEQ ID NO:8. 

18. A purified mammalian cerberus protein produced by the steps of 

(a) culturing a cell transformed with a DNA comprising the nucleotide sequence from 
nucleotide #268 to #801 as shown in SEQ ID N0:7; and 

(b) recovering and purifying from said culture medium a protein comprising the amino 
acid sequence from amino acid #90 to amino acid #267 as shown in SEQ ID NO:8. 
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19. A composition comprising a therapeutic amount of at least one mammalian 
Cerberus polypeptide according to claim 18. 

20. A method for altering the regulation of neuronal genes in a patient in need of same 
comprising administering to said patient an effective amount of the composition of claim 19. 

21 . A purified manmialian cerberus protein comprising the amino acid sequence 
from amino acid #1 to #267 of SEQ ID N0:8. 

22. Antibodies to a purified manmialian c^rZ^eruj protein according to claim 21. 

23. A purified mammalian cerberus protein comprising the amino acid sequence 
from amino acid #90 to #267 of SEQ ID NO:2. 

24. Antibodies to a purified mammalian cerberus protein according to claim 23. 

25. A purified mammalian cerberus protein produced by the steps of 

(a) culturing a cell transformed with a DNA comprising the nucleotide sequence from 
nucleotide #52 to #801 as shown in SEQ ID N0:7; and 

(b) recovering and purifying from said culture medium a protein comprising the amino 
acid sequence from amino acid #18 to amino acid #267 as shown in SEQ ID NO:8. 
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